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Abstract — We investigate joint network and channel coding 
scliemes for networks wlien relay nodes are not capable of per- 
forming channel coding operations. Rather, channel encoding is 
performed at the source node while channel decoding is done only 
at the destination nodes. We examine three different decoding 
strategies: independent network-then-channel decoding, serial 
network and channel decoding, and joint network and channel 
decoding. Furthermore, we describe how to implement such joint 
network and channel decoding using iteratively decodable error 
correction codes. Using simple networks as a model, we derive 
achievable rate regions and use simulations to demonstrate the 
effectiveness of the three decoders. 



I. Introduction 

Classically, communication over a network involves net- 
work nodes whose sole function is the routing of packets. 
Recently, however, Ahlswelde et al. HI observed that by allow- 
ing the intermediate network nodes to combine information, a 
greater network throughput can be obtained. This strategy is 
referred to as network coding. 

In much of the literature on network coding, each link is 
assumed to have its own independent channel coding system 
and hence each link is assumed to be error-free. Indeed, 
for certain independent memoryless networks, separating the 
channel and network coding in this way guarantees asymp- 
totically optimal error correction IS), ||3l. However, for other 
networks, examples have been given showing that, in general, 
network and channel coding must be performed jointly to 
achieve the best performance |4|. 

Several authors have investigated this form of combining 
channel and network coding for the wireless relay channel. Use 
of iteratively decodable codes such as turbo codes Q, ||6l and 
low-density parity-check (LDPC) codes tTJ, QOl, Ell 

are also common. A common feature in all of these schemes 
is that the relay node decodes each packet prior to performing 
network coding. In addition, |6| and [12| include automatic 
repeat requests (ARQ) as another layer of error protection. 

Recently, |13| and |14| have investigated networks where 
the network nodes have differing capabilities. In particular, 
041 considers a hierarchical network where sensors have 
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limited computing and communication capabilities and inter- 
mediate relay nodes, which communicate to a central server 
or access point, are more capable. On the other hand, ifTSll 
looks to minimize the capabilities required by network nodes, 
proposing networks where not all nodes necessarily perform 
coding functions. 

In this paper we investigate the combination of channel and 
network coding in a simple cooperative network with noisy 
network links. Similar in spirit to lfT3l and lfT4l . we assume 
the intermediate nodes have limited computing capabilities. In 
particular, we assume the intermediate nodes do not perform 
channel coding operations. Rather the nodes simply forward 
packets or perform the operation of XOR'ing two incoming 
packets. This differs from the traditional network coding 
strategy of decoding each packet at each node, XOR'ing the 
messages and then re-encoding the result. In our networks we 
only perform end-to-end channel coding, all channel encoding 
operations are performed at the source and all decoding 
operations are done at the destination. We model each link 
in the network as a binary symmetric channel as we assume 
each node makes a hard decision on its received signals. 

We investigate three decoding strategies: independent 
network-then-channel decoding, serial network and channel 
decoding, and joint network and channel decoding and illus- 
trate these strategies using LDPC codes. LDPC codes have 
been proposed for many network based applications including 
relay-networks Q, lITSl and sensor networks Ill6l . Note that, 
unlike the schemes in fl] and fT5l|, the messages in our 
networks are only encoded by the channel code once at the 
source(s) and decoded once at the destination node. Unlike the 
schemes in fT6\ the sources are not correlated. 

In Section |ll] we describe the three above-mentioned de- 
coding strategies. In Section III we derive achievable rate 



regions for each decoding strategy on a simple network to 
demonstrate the advantage of joint decoding for cooperative 
networks with noisy network links and end-to-end channel 
coding. In Section IV we describe how iterative decoders 



for low-density parity-check (LDPC) can be constructed in 
practice for each decoding strategy and provide simulation 
results showing the benefits of the proposed joint network and 
channel decoding strategy. 

II. Independent, Serial, and Joint Decoding 

Suppose the source(s) generate binary message vectors 
Ui, . . . , U5 which are each encoded with the channel codes 
Ci, . . . ,Cs, respectively (however the same code can be used 
for some or all of the messages). For simplicity, we will 
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assume that all codes are of the same length and each 
packet contains a single codeword. The generator matrices for 
the codes Ci, . . . , C5 are given by Gi, . . . , respectively 
and so the codewords for messages Ui,...,U5 are thus 
Ci = UiGi, . . . ,cs — U5G5, respectively. The noisy versions 
of the codewords received at the destination(s) are labeled 
Ci, . . . , £5, respectively and we will write Ci, . . . , C5 for the 
decoded codewords. Packets which contain the modulo-2 sum 
of two or more (noise-corrupted) codewords are produced by 
the low-complexity intermediate network nodes which employ 
network coding to improve the throughput of the network. We 
will write Ci,j for CiQCj, where represents a bit-wise XOR 
(or bit-wise modulo-2 addition), and thus for the noisy 
received version of j. We assume the destination(s) know 
which codebook Gi, G5 generated the original codewords 
and how packets have been combined while traversing the 
network; e.g., via the use of a packet header attached to each 
packet. 

The aim of this paper is to decode the messages Ui , . . . , U5 
when the destination has noisy versions of one or more of 
these combined packets and may also have noisy versions 
of one or more packets containing original codewords. The 
approaches we consider are independent network and channel 
coding, serial network and channel coding and joint network 
and channel coding. 

A. Independent network-then-channel decoding 

The throughput benefit of the network code will be realized 
simply by performing network decoding at the destination to 
recover noisy versions of the transmitted codewords before 
independently decoding each codeword with its corresponding 
error correction code. 

For example, a destination node which receives Ci, 62 and 
on three incoming links will calculate c?3 = ® c^. 
The original messages can then be found by using channel 
decoding on 61 to obtain Ci and separately decoding 62 to 
obtain £2 and £3 to obtain £3. All three channel decoders can 
be run in parallel to improve the speed of the decoding at the 
destination. However, this strategy is clearly suboptimal as the 
noise in £^ will be carried into the calculation of £3. 

B. Serial network and channel decoding 

A potential improvement over independent network and 
channel coding is to employ serial decoding, where decoding 
is performed on the packet received on one of the incoming 
links and the decoded information from this first link is shared 
with the decoders for the packets received on the remaining 
links. This is repeated one link at a time until the all of the 
packets are decoded. 

The obvious strategy is that channel decoding is first per- 
formed on the packets corresponding to an original codeword. 
Network decoding is then applied using the decoded code- 
words and the received combined packets (i.e., network-coded 
packets) to obtain noisy versions of the remaining codewords. 
These are then decoded with their respective channel decoder 

For example, a destination which receives £1^, £2 and £1^3 
will decode £2 to obtain £2, then calculate £1 = £2 ® Ci^. 
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Fig. 1 . Channel inputs, cliannel outputs, and cliannel transition probabilities 



Then £1 is decoded by its channel decoder to obtain £1 
and £3 = £1 © £1^. Finally £3 is separately decoded by 
its channel decoder to obtain £3. This method can improve 
the performance of the channel decoding over independent 
schemes, as we will see in Sections [V] and [111] but increases the 
decoding delay at the destination since the channel decoders 
are run serially. In this example three decoder applications are 
required in series but the extra delay will only grow as the 
principle is extended to more complex networks. 

C. Joint network and channel decoding by defining a joint 
code 

In joint network and channel decoding we define a single 
error correction code which incorporates the structure in the 
individual channel codes and the structure in the network. In 
Section |IV] we will see two ways to achieve this using LDPC 
codes, by defining a code on the joint codeword consisting of 
all received packets, or the joint codeword consisting of all 
transmitted and all received packets. 

III. Capacity and Achievable Rate Regions 

In this section we analyze the achievable rates (in the Shan- 
non sense) of the decoding strategies, without any assumption 
on the code structure, meaning that the channel codes are not 
necessarily LDPC. We consider the network depicted in Fig.[T| 
as the simplest network which combines both network and 
error correction coding. The technique presented in this section 
can be easily extended to more complex networks. 

Each link i — > j from node i to node j, is a binary 
symmetric channel (BSC) with crossover probability pi j. The 
packet transmitted over link i — > j, is j, and the received 
vector is denoted j . We assume that the channels for the 
links are independent, time invariant, and memoryless. For the 
BSC i j, the transition probability function is given by 



Pr{Fij = m\Xi^j = m} = 1 ~ pi^ 
Pr{y,^j = = m} = 



TO €{0,1} (la) 

TO, n G {0, 1}, n 7^ TO. 

(lb) 
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In words, with probability pij the input symbol is received in 
error. Equivalently, we can write 

= Xij e (2) 

where Eij € {0, 1} and Pr{i?jj = 1} = Pi,j- 

In this network, nodes 1 and 2 are sources for messages A 
and B respectively, but with the constraint that X13 — Xi 4. 
This captures the fact that all source messages are encoded 
only once, at their respective source nodes, and are not de- 
coded (and re-encoded) except at the destination. We let A and 
B be independently, randomly, and uniformly chosen from the 
message alphabets {1,2,..., [2"^^ J} and {1,2,..., [2"-^«J } 
respectively, where n is the block length of the channel codes 
for all channels. The aim is to send both A and B to node 4 
in n channel uses on each link. We use A and B to denote 
the estimates for A and B respectively at node 4. The rate 
pair {Ra,Rb) is achievable if Pi{{A,B) ^ {A,B)} can be 
made arbitrarily small. Node 4 can reliably decode A (or B) 
iff Ra (or Rb) is achievable. The capacity is defined as the 
set of all achievable rates. 

It can be easily shown that the capacity of the BSC i j 

is 

=l-i/(p,,,), (3) 

where = -pijlogp^^j - (1 - Pij)log(l - Pij)- 

The capacity is achieved with an equiprobable channel input 
distribution p{xij). 

Transmission at sources: Nodes 1 and 2 send codewords 
for messages A and B respectively: node 1 sends Xi 3(A) e 
{0,1}" and Xi^4(^) e {0,1}" on links 1 ^ 3 and 1 ^ 4 
respectively, where Xi,3(yl) — Xi.4(A); and node 2 sends 
X2,3(B) e {0, 1}" on link 2^3. 

Linear codes: We assume that linear codes are used. It has 
been shown by Elias IITtII that the capacity of the BSC is 
achievable by linear codes. 

A. Independent Network-then-Channel Decoding 

Strategy: Node 4 decodes Xi.4(A) from Y1.4. Indepen- 
dently on the other link, it subtracts Yi_4 from Y3 4 and then 
decodes X2.3(i?). 

Theorem 1: The achievable rate region for independent 
network-then-channel decoding, Anc, is the convex hull of all 
{Ra,Rb) satisfying 

Ra < Ci,4 (4a) 
Rb < C". (4b) 

Here, Ci_4 = 1 — i?(pi,4) is the capacity of link 2 — > 6, and 
C" = 1 — H{p") is the capacity of a BSC with cross-over 
probability p" given in (|6|. 

Proof: As message A is decoded from Yi 4, we see a point- 
to-point BSC A'1.4 YiA. So, we have (|4a]). 

By subtracting Yi,4 from Y3.4, we get 

Y3,4®Yi,4 = X2,3(i3)©E", (5) 

where E" = Ei^3eE2,3eE3,4©Ei^4. This can be viewed as 
an equivalent BSC ^2.3 Y, where Y = ¥3.4 © 1^1,4, with 



cross-over probability p", where 

p" = Pi{E" = 1} 

= Pl,3(l -P2,3)(l -P3,4)(l -Pl,4) 
+ (1 - Pi, 3)^2,3(1 -P3.4)(l -Pl,4) 
+ (1 - Pl,3)(l - ^2,3)^3,4(1 - Pia) 
+ (1 -Pl,3)(l -P2,3)(l -P3,4)Pl,4 
+ (1 -pi,3)P2,3P3,4Pl,4 +Pl,3(l - P2,3)P3,4Pl,4 
+ Pl,3P2,3(l -P3,4)Pl,4 +Pl,3P2,3P3,4(l -Pl,4)- (6) 

So, node 4 can reliably decode message B from Y3.4 © Yi 4 
up to the rate in ( |4b| i. 

Now, since the rate pair {Ra,Rb) — (Ci_4,C") is achiev- 
able, so are the rate pairs (C1.4, 0) (by switching node 2 off), 
(0, C") (by switching node 1 off), and (0,0) (by switching 
both nodes 1 and 2 off). By time sharing among any three of 
these rate pairs, any rate pair in the convex hull of (Ra,Rb) 
satisfying (|4a]l and (|4b]) is achievable. ■ 

Remark 1: Note that the order of network and channel 
decoding can also be reversed to give an independent channel- 
then-network decoding. This strategy was considered in our 
previous work ||20l . In channel-then-network decoding, node 4 
first performs channel decoding independently on links 1 — > 4 
and 3 — > 4 to obtain Xi,4(A) and X(A, B) respectively, 
where X(A, B) is a codeword that is a function of the 
messages A and B, which is a result of the bit-wise XOR 
operation performed at node 3. Node 4 then performs network 
decoding to obtain message B from Xi 4(^) and X(A, _B). 
This strategy is not considered in this paper as the codebook 
{X(A, i?)} defined for the combined messages A and B is, 
in general, not guaranteed to have the properties required for 
efficient decoding for the LDPC implementation in Section 
HY] 

B. Serial Network and Channel Decoding 

Strategy: Node 4 first decodes message A from link 1 — 4. 
It then reconstructs Xi_3(j4) and subtracts it from Y3.4 before 
decoding message B. 

Theorem 2: The achievable rate region for serial decoding, 
Aseriab IS the convex hull of all {Ra,Rb) satisfying 

Ra < Ci,4 (7a) 
Rb < C, (7b) 

where C = 1 — H{p') is the capacity of a BSC with cross-over 
probability p' given in (|9]). 

Proof: As message A is first decoded from Yi 4, we get 

By subtracting Xi,3(^) from Y3 4, we get 

Y3,4®Xi,3(A) =X2,3(S)®E', (8) 

where E' = E1.3 E2,3 ffi E3^4. By doing this, we get an 
equivalent BSC with cross-over probability 

Pr{E' = 1} =p' = [(1 -pi.3)(l -P2,3) + P1,3P2,3]{P3a) 
+ [Pl,3(l -P2,3) + (1 -Pl,3)P2,3](l -P3,4)- (9) 

So, node 4 can reliably decode message B if ( |7b] i is satisfied. 
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C. Joint Network and Channel Decoding 

Strategy: Messages A and B are jointly decoded from the 
received messages Yi 4 and Y3 4. 

Theorem 3: The achievable rate region for joint decoding, 
Ajoint, is the convex hull of all {Ra,Rb) satisfying 

< Ci,4 + C' -C" (10a) 
Rb < C (10b) 
+ < Ci,4 + C', (10c) 

where C" = 1 — H{p") is the capacity of a BSC with cross- 
over probability p" given in (|6]l, and C — 1 — H{p') is the 
capacity of a BSC with cross-over probability p' given in (|9|. 

Proof: By doing joint decoding, we see a multiple-access 
channel [ 18j. |.19,| from Xi and X2.3 to Y4, where Xi ~ 
Xi 3 = Xi 4 and Y4 — (^1,4,^3, 4)- Hence, we have the 
following capacity region: 



0.8- 



Ra < HX,;Y4\X2^3) 
Rb < I{X2,3;Y4\Xi) 

Rb<HXi,X2X,Y4), 



(11a) 
(lib) 
(11c) 



maximized over all possible p{xi,X2.3)- The capacity region 
can be attained by independent and equiprobable Xi and X2,3- 
Evaluating the RHS of ( |lla[ ) gives 

I{Xi^3, Xi^4; Yi,4, il5,4|-^2,3) 

- H{Y,^4, 1"3.4|^2.3) - H{Y,,4, ^3,41^2,3, Xi,3, X,,4) 

(12a) 

= (1 + H{p")) - (i?(pi,4) + H{p')) (12b) 
= Ci^4 + C'-C" (12c) 

where p" is given in (|6]l, and p' is given in (|9]l. 
Next, evaluating the RHS of ( |1 lb| l gives 

I{X2,3'i ^3,4, ^1,4|-''^1,37 X14) 

= I{X2,3;Y3a\Xi^3,Xia) (13a) 

= /(^2,3; 5^3,41^1.3) (13b) 

= C'. (13c) 

( |13a[ l is because given (Xi_3,Xi_4), ^2,3 and li_4 are inde- 
pendent, as Yi_4 — Xi 4 — {Xi^3, X2^3) forms a Markov chain. 
( fT3cl i follows from and (|7fe]). 

Finally, evaluating the RHS of ( |llc[ ) gives 



-^(-'^1,4, -^^1,31 -^^2,3; ^1.4, ^3,4) 

= H{YiA,Y3,4) - H{Y4^4,Y3a\Xia,Xi^3,X2,3) 

= 2-{H{p4A + H{p')) 

= Ci4 + C'. 



D. Comparison 

Theorem 4: The achievable rate regions satisfy 



■^nc ^ -^serial ^ -Ajoint 
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(14b) 
(14c) 
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Fig. 2. Achievable rate regions for different decoding schemes for BSC, 
Pi i = 0.05 for all links. 



Proof: We can show that 

P"-P' = (1 - 2pi.3)(l - 2p2,3)(l - 2P3.4)P1,3 > 0. (16) 

The inequality above is because < Pi, 3,^2,3,^3, 4 < ^- It 
can be shown by induction that < p',p" < \. This means 



(15a) 
(15b) 



H{p") < H{p') and C" < C. So, the constraint ^ is at 
least as strict as the constraint ( fTb] ), and hence Anc C Aseriai- 

Lastly, the constraint ^7a\ is at least as strict as the constraint 
( |lla| i because C > C". Summing (|7a]) and (|7b]i gives ( |llc| i. 

So, Aseriai ^ Ajoint- ^ 

For example. Fig. |2] shows the achievable rates of the three 
decoding strategies when pij = 0.05 for all links. 
In summary, we have the following comparison: 

1) Serial decoding has the same rate region as network- 
then-channel decoding for Ra because both decodes A 
from Yi 4. But serial decoding can improve Rb over 
network-then-channel decoding as it subtracts a clean 
(decoded) version of ^i^3{A) from the received message 
Y3_4, and thus cancels the interference from message A 
before decoding message B. For network-then-channel 
decoding, a noisy version of Xi 3(A) is subtracted from 
Y3.4, and while the interference from message A is 
removed, additional noise is also introduced at the same 
time. 

2) Joint decoding can improve Ra because node 4 decodes 
message A from both Yi 4 and Y3 4 in joint decoding 
but solely from Yi 4 in all other schemes. Joint decoding 
does not improve Rb over serial decoding as only 
Y3 4 carries information about B. Upon canceling the 
interference by A when decoding B, serial decoding 
akeady obtains the best rate region for B. 

The fact that serial decoding can improve Rb over network- 
then-channel decoding, and that joint decoding can improve 
Ra over both serial and network-then-channel decoding is also 
true for other channel models, for example the additive white 
Gaussian noise channel channel where each relay can only 
forward the summation of the signals it receives, scaled to 
account for constraint on the relay transmit power. 

While it is not surprising that joint decoding performs better 
than serial decoding, which in turn outperforms independent 
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decoding, the rate region characterization allows us to analyze 
the improvement of individual source data rates. It is inter- 
esting to see that serial decoding is actually able to achieve 
a segment on the capacity boundary in this example. This 
suggests that if a node in the network only needs to decode 
data from selected sources, it may not lose much performance, 
as far as achievable rate is concerned, by considering the links 
independently. However, if the node is to decode the data from 
all the sources, performing independent or serial decoding 
results in a significant performance loss. 

IV. Joint Network and Channel Coding Using 
Low-Density Parity-Check Codes 

In this section we consider how to combine network and 
channel coding using low-density parity-check codes and joint 
iterative decoding. LDPC codes are block codes described 
by a sparse parity-check matrix 11211 . Il22ll first presented by 
Gallager in 1962. Gallager proposed an iterative decoding 
algorithm, now called sum-product decoding, which utilizes 
the sparsity of the parity-check matrix to decode iteratively 
with complexity linear in the code length. Using sum-product 
decoding, LDPC codes have been shown to perform remark- 
ably close to the Shannon limit on many channels ll23l . Il24l . 

A length n LDPC code is designed by specifying a sparse 
mxn parity-check matrix H, and the code dimension is fc = 
n — rank2(-ff). In most cases rank2(i/) ~ m and ?- = 1 — ^ 
is called the design rate. A generator matrix for the code can 
be found using Gauss-Jordan elimination on H or encoding 
can be performed directly from H in some cases. 

An LDPC code is {wc, Wr) -isgnlar if all the columns of H 
have Wc non-zero entries and all of the rows of H have Wr non- 
zero entries. A Tanner graph, |25|, displays the relationship 
between codeword bits and parity checks in H. Each of the 
n code bits, and m parity checks in H are represented by a 
vertex in the graph. A graph edge joins a code bit vertex to 
the vertices of the parity checks that include it. A cycle in a 
Tanner graph is a sequence of connected code bits and parity 
checks which start and end at the same vertex in the graph 
and contain no other vertices more than once. The existence 
of cycles in the Tanner graph are well known to hinder the 
performance of the sum-product decoding algorithm (see e.g. 
||22|) and most LDPC codes are designed to avoid cycles of 
size-4 (called 4-cycles) or less. 

We will propose two joint network and channel decoding 
strategies which combine the parallel decoding advantages of 
independent decoding and improve upon the error correction 
performance of serial decoding by sharing error correction 
information between the channel decoders. 

In our first strategy we define a joint channel code which 
describes the mapping of each transmitted message Ui , . . . , us 
into each of the packets which have been received by the 
destination. In effect we are incorporating the operations of 
the network code into an extended channel code. 

For example, a destination which receives Ci and (such 
as node 4 in the network depicted in Fig. [T]) will define the 
generator matrix for the code which maps Ui, and U2 to Ci 
and Ci C2- For simplicity we will assume that the generator 



and parity-check matrices are in standard form; i.e. the first k 
columns of G (respectively last m — n — k columns of H) 
form akxk (respectively m x m) identity matrix. However, the 
resulting joint matrices apply equally for the non-systematic 
parity-check matrices that are generally defined for LDPC 
codes. 
Let 

Gi = [/(fc),gi] G2^[i{k),g2], 

Hi = [HuI{m)] H2 = [n2,I{m)], 

where Hi is the transpose of Qi, H2 is the transpose of Q2 
and I{k) is the k x k identity matrix, and both code rates are 
the same. Consider a generator matrix Gjoint for a code which 
generates the codeword 



Cjoint 

The first n bits in c 



= [ci,ci ® C2]. 



joint are simply the codeword for Ui 
generated by UiGi and the second set of n bits in Cjoint are 
UiGi ® U2G2. Putting these equations in matrix form gives 
the generator matrix: 



G 



joint 



Gi Gi 
0{k,n) G2 

I{k) Gi I{k) Gi 
0(fc,fc) 0(fc,m) I{k) 02 

where 0(fc, n) is the fc x n all zeros matrix. We can then write 

Cjoint — [Ul, U2] Gjoint- 

Gi and G2 are already in standard form so to put Gjoint 
into standard form involves k row operations where the j-th 
row of Gjoint 1 for 1 < J < fc, is replaced by the modulo-2 
sum of the j-th and k + j-th rows of Gjoint resulting in the 
matrix 



G' — 

gjoint 



I{k) Gi 0{k,k) Gi(SG2 
0(fc,fc) 0{k,m) I{k) G2 



We can then define a joint network / channel parity-check 
matrix for the network by 



joint 



Then 



Hi 

•Hi 0% 

Hi 
Hi®H2 



1(171) 
0(m, to) 

0(to, n) 
H2 



0{m,k) 0(m, m) 



I{m) 



Cjoint-Hj^int = 0(1, 2m), 

and we can jointly decode Ci and Ci^ using iJjoint to give Ci 
and Ci^. The decoded codeword for U2 is then simply 

C2 = ci e 

Note that the generator matrix Gjoint was defined only to 
motivate iJjoint, it will not be employed by the source node 
which encodes Ui and U2 traditionally using Gi and G2. 
Importantly, iJjoint is sparse when Hi and H2 are sparse 
so -f/joint describes an LDPC code. Unlike independent and 
serial decoding, joint decoding enables the decoder to use the 
information in Ci~2 to decode Ci. 
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If Hi and H2 are independent sparse parity-check matrices 
the matrix Hi H2 will have many entries in common with 
Hi (and H2)- This will lead to a significant number of 4-cycles 
in the columns of fljoint which contain both Hi © H2 and Hi 
(and the rows of -f/joint which contain both Hi H2 and H2) 
and 4-cycles are well known to hinder the performance of the 
sum-product decoding algorithm (see e.g. ||22|)- 

To avoid these 4-cycles we can design Hi and H2 so that 
the ith column of H2 contains all but one of its entries in 
common with the ith column of Hi. However, this strategy is 
only practical for networks with a limited number of channel 
codes. An alternative strategy for joint decoding that avoids 
4-cycles in the joint Tanner graph is defined below. 

For the special case where both messages are encoded with 
the same code (i.e. H2 — Hi) the joint parity-check matrix is 



H 



joint 



Hi 0(to, n) 
0(m, n) Hi 



The structure of iJjoint reflects the fact that, since Ci and C2 
are both codewords of the linear code represented by Hi, so 
too is Ci C2- Thus, when the codes used for the messages 
are the same, decoding with iJjoint is actually independent 
channel-then-network decoding rather than joint decoding. 

A. Joint network and channel decoding on an extended Tanner 
graph 

In this strategy a joint Tanner graph is defined for the 
channel and error correction codes. The network coding op- 
erations are simply modulo-2 sums of codewords and so can 
be considered as parity-check equations which constrain the 
bits in the combined packets. The extended Tanner graph 
includes the graphical representation of each of the parity- 
check matrices Hi, ... , Hs as well as bit nodes for each of 
the combined packets, and constraint nodes for each of the 
network coding operations. 

For example, a destination which receives Ci, Ci^ and CiTa 
will form a Tanner graph which describes each of the parity 
check matrices Hi, H2 and H^, includes bit nodes for all of 
the bits in Ci 2 and Ci 3 and parity-check nodes for each of 
the network coding operations 



Cl,2 = Ci 



C2, 



Cl,3 Ci ® C3. 

Fig. [3] shows an extended Tanner graph at the destination 
node which can be used to find Ci, C2, and C3 when cl, Ci^ 
and Ci^ are received. The a priori bit LLRs for the bits not 
received directly by the destination node are set to zero. 

Different schedules can be used to decode the extended 
Tanner graph but we will use a schedule of message passing 
decoding where one iteration of the decoder corresponds to 
all of the bit nodes (for the codewords and combined packets) 
updated in parallel and all of the check nodes (for the channel 
codes and network codes) updated in parallel. Note that this 
method of joint decoding for the butterfly network was first 
presented in an earlier conference version of this paper ll26l 
and independently, with slightly different scheduling, in ll27l . 




Fig. 4. The butterfly network 



V. An Example - The Butterfly Network 

In this example we will consider the butterfly network of 
Fig. |4](see, e.g. |1|). Each link i — > j from node i to node j, 
is a binary symmetric channel with crossover probability pi j. 
The source, node 1, generates two binary messages u^i and 
ub. The codewords for messages u^i and are ca — uaGa 
and cb = vlbGb, respectively. We also define a code Cab 
which consists of the set of codewords 

cab^ca®cb Vc^,Cb. 

The codeword is transmitted over link 1 — > 2, i.e. 
X1.2 ~ Ca and the codeword cb is transmitted over link 
1 — > 3, i.e. Xi 3 = Cb- Each of the nodes 2, 3, and 5 simply 
forward on the vector they receive, no processing is done of 
any kind; i.e. X2,6 — X2,4 = Yi,2 etc. Node 4 performs 
network coding by combining the packets at its input 



X4^5 — Y2,4 ® Y3_4 

= C^©Ei 2®E2.4©CBi 



E 



1,3 ' 



>E 



3,4- 



No channel decoding is performed so X4 5 can be thought of 
as a noisy version of the codeword cab- 

The destination, node 6, knows which channel codes have 
been used and has available Y2.6 = ca, a noisy version of 
c^, and Ys e = cab, a noisy version of the codeword cab- 
(Although we only consider node 6, an identical argument 
applies to the node 7.) As the nodes can only re-transmit 
the binary vector detected at their input (or XOR two such 
binary vectors) errors added by the links will occur as flipped 
bits. Thus for networks which transmit over more general 
memoryless channels the network can still be modeled using 
binary symmetric channels for the links. 

We can define a joint network / channel parity-check matrix 
for the butterfly network between nodes 1 and 6 by 



H 



joint 



Ha 0{m,n) 
Ha ffi Hb Hb 
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and so we can jointly decode and cab using ^/joint to give 
and Cab- The decoded codeword is then simply 



cb=ca® Cab- 



(17) 



For the extended joint network and channel coding we 
define the extended codeword as the concatenation of the 
codewords ca, cb, and c^^: 

Cextn = [ca,Cb,Ca ® Cfi]. 

It is easy to see that such codewords must satisfy the parity- 
check matrix 

Ha 0(171, n) 0{m,n) 
-ffextn = 0(m, n) Hb 0(m, n) 
I{n) I{n) I{n) 

The relationship cab — ca ® cb is represented by the last 
n parity-check constraints in i?extn- ^^extn will be 4-cycle free 
whenever Ha and Hb are designed to be 4-cycle free. Of 
course, the decoder at node 6 does not have values for a priori 
input probabilities for with which to perform decoding 
using i^extn- This can be easily remedied by passing in a priori 
probabilities p{cB(i) = 1) = p{cB{i) = 0) 0.5 for these 
received bits. 
Then 

Cextn -ffe^tn = 0(1, 2m + n), 

and so we can jointly decode ca and cab using i?extn to give 
Ca and cg; i.e. with this scheme the decoded packets ca and 
cs are returned directly by the joint decoder. 

A. Simulation results 

Different randomly constructed (3, 6)-regular rate- 1/2 LDPC 
codes free of 4-cycles (see e.g. |j22]|, and code from 1281 ) 
are used for the channel codes Ca and Cb as they have 
a sparse parity-check matrix representation with good sum- 
product decoding performances. Random codes are chosen to 
focus on the decoding strategies rather than any effects of a 
particular code design. We use codewords of length 500 bits 
and apply standard sum-product decoding with a maximum of 
20 decoder iterations. For the independent and serial decoding 
schemes this means a maximum of 20 iterations for each 
channel decoder, but for the joint decoding schemes the single 
joint decoder uses a maximum of 20 iterations. So that each 
path is subject to roughly the same level of noise, the links 
each have crossover probability p except for link 2 — > 6 which 
has crossover probability 3p. 



10 f . 

+ Independent 

O Serial 

10"' r □ Joint (Hj„i„,) 

; Joint (HJ 



10"' 



e 10 



10 r 



lO"" 




C.0C2 0.004 0.006 0.008 0.01 0.012 0.014 0.016 0.018 
Link crossover probability (p) 

Fig. 5. Different length-500, rate- 1/2, LDPC codes are used to encode U4 
and ug whicli are transmitted over tlie network in Fig. |4] Sliown is tlie bit 
en'or rate peif ormance of packet A (solid curves) and packet B (dotted curves) 
using the four different decoding schemes. The link crossover probabilities 
are p with the exception of link 2 — > 6 which has crossover probability 3p. 



Fig. |5] shows the error correction performance of the various 
decoding methods when and us are encoded with different 
randomly chosen LDPC codes. We can see that independently 
decoding with the network and then channel codes performs 
as expected, returning poor performances for the decoding of 
ub, since it is corrupted by both the errors on ca and those on 
Cab- Also as expected, using serial decoding or either version 
of joint decoding, cb is only corrupted by the errors from 
that remained after decoding and so the decoding performance 
of Ub is significantly improved. 

Fig. |6] emphasizes the benefit that joint decoding, by using 
the network code as part of a larger error correction code, can 
provide to ua over decoding with the channel code alone. 
In this simulation the network is modified to increase the 
crossover probability on link 2 —> 6 to be 12p- 

Overall, serial decoding performs equally as well as inde- 
pendent network-then-channel decoding for A but improves 
the performance of B- The error rate of the first joint decoding 
scheme however, is poorer than that of serial decoding for A as 
it is hampered by the 4-cycles in the parity-check matrix Hab- 
The extended joint decoding scheme, which is able to jointly 
decode A and B using a cycle free Tanner graph, outperforms 
all the other schemes for both A and B- 
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Link crossover probability (p) ^ ^ ^-3 

Fig. 6. Different length-500, rate- 1/2, LDPC codes are used to encode U4 
and ug whicli are transmitted over tlie network in Fig. |4] Shown is tlie bit 
en'or rate peii'ormance of packet A (solid curves) and packet B (dotted curves) 
using the four different decoding schemes. The link crossover probabilities 
are p with the exception of link 2 — > 6 which now has crossover probability 
12p. 



The network coding performed at node 4 to improve the 
throughput of the network has the unavoidable effect of 
reducing the BER performance of packet B (when compared 
to a network which uses two channel transmissions at node 
4 to send A and B separately to node 5). However, by using 
joint decoding, this loss in performance can be significantly 
reduced and, furthermore, the network coding can even be used 
to improve the BER performance of packet A (compared to 
the network without network coding). 

Although joint decoding can not improve the rate region 
(in the Shannon sense, i.e., rates with error probability ap- 
proaching zero using infinitely long code length) for B over 
that of serial decoding, simulation results show a bit error rate 
performance improvement for B when joint decoding with 
i/extn is used, showing that by jointly decoding A and B 
the convergence performance of B can be improved. Or put 
another way, if ca is received without error there will be no 
improvement in the performance of joint over serial decoding 
of cb ■ However, using a finite code length, where errors occur 
in Cji, joint decoding can remove more errors from so 
fewer errors remain to corrupt c^- Furthermore, the decoding 
of A and B can be successively improved iteratively using a 
better estimate of A to improve the decoding of B and vice 
versa. 

B. Complexity 

The decoding schemes proposed here do not employ chan- 
nel codes in the intermediary network nodes, so the net- 
work complexity remains the same for all decoding schemes. 
Sources A and B are encoded independently by the encoder 
in the same way for each scheme and so the source node com- 
plexity also does not change. The only increase in complexity 
for the joint decoding schemes occurs for the decoder at the 
destination node. 

In general, the complexity of the sum-product decoding 



algorithm is a linear function of the number of non-zero entries 
in the parity-check matrix. For (3, 6)-regular length n LDPC 
codes the total number of non-zero parity-check matrix entries 
in Ha and Hb is 3n each. Thus, the independent and serial 
decoding schemes have a total of 6n non-zero parity-check 
matrix entries (over both matrices), while the joint network / 
channel decoding matrix can have up to Qn additional non-zero 
entries (if there are no entries overlapped in Ha and Hb), and 
the extended joint decoding matrix has 3n additional non-zero 
entries. Thus, using rate half codes the number of non-zero 
parity-check matrix entries is 6n for independent decoding, 
between 6n (common channel codes) and 12n (completely 
disjoint parity-check matrices) for joint decoding and 9n 
for extended joint decoding. For all the schemes, decoding 
complexity remains linear in the block length, and while 
the joint decoding schemes have a slightly higher decoding 
complexity per iteration, their improved performance means 
that fewer decoder iterations are actually required. 

VI. Conclusion 

In this paper we have considered decoding schemes for 
low complexity networks, where a message is encoded at 
the source node and decoded at the destination node, and 
intermediate nodes perform network coding operations, in our 
case modulo-2 addition, but no channel coding. We have inves- 
tigated three potential decoding schemes for the destination: 
(1) independent decoding where the destination decodes data 
from each link independently, (2) serial decoding where the 
destination decodes data from each link independently, but in 
series and by using the knowledge of previously decoded links, 
and (3) joint decoding where the destination jointly decodes 
all data from all the links simultaneously. 

In networks with noisy links and low-complexity interme- 
diary nodes it can still be of benefit to perform a simple net- 
work coding strategy, involving the XOR of (noise-corrupted) 
codewords rather than messages, and use joint decoding at the 
destination to retrieve the transmitted codewords. We saw that 
our proposed decoding scheme improves the error correction 
performance of the network over the independent and the 
serial decoding schemes, and does so without adding network 
complexity. This is achieved by describing a joint network 
code / channel code at the destination node and decoding 
with iterative sum-product decoding. The new schemes require 
no channel coding at the intermediary network nodes and 
only a small amount of additional decoding complexity at the 
destination node. The joint decoding produces improved error 
correction performances and can significantly improve decoder 
speed. 
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